04. Comparison
Lesson 4 04 Comparison
Definition
When your problem statement requires the comparison of two data features or cohorts
_ Example_
Problem statement reads: What are the demographic differences between the Top 10% of FIFA players by market value and the remaining 90% of players?
Which visualization will work best?
- Box plots will provide a comprehensive picture of how the two cohorts are comparing:
- Center will tell if on average the cohorts are similar
- Spread will tell you if they vary differently
- Shape (symmetry, skewness) will indicate any asymmetry
- Unusual features (outliers, missingness)
If you need a refresher on box plots, feel free to explore the refresher course on descriptive statistics available in our pre-requisite courses within the classroom.